perf(service): Write HV tombstone before LT data to reduce orphan risk by jan-auer · Pull Request #365 · getsentry/objectstore

jan-auer · 2026-03-10T14:39:21Z

Previously, large-object inserts followed LT-first ordering: write data to long-term storage, then write the redirect tombstone to high-volume. Concurrent inserts or pod kills between those two steps left an orphaned long-term object — data in LT with no tombstone in HV — permanently unreachable with no recovery path.

This flips the ordering to HV-first: write the tombstone first, then write the data. A failure between the two steps now produces a headless tombstone (tombstone in HV, no data in LT) instead. Headless tombstones are safe and self-healing: reads return None, deletes remove them, and re-inserts overwrite them.

The tradeoff is deliberate: we lower the risk of orphans in long-term storage — which are silent data leaks with no recovery — and instead accept headless tombstones, which are a well-defined recoverable state.

For small objects, a new put_non_tombstone trait method atomically rejects the write if a tombstone already exists at the key, routing the payload to long-term storage instead. BigTable implements this with CheckAndMutateRowRequest; other backends fall back to a non-atomic read-then-write.

One gap remains: a concurrent insert + delete can still race to produce an orphaned long-term object. Fixing that requires per-key serialization. This PR is an intermediary improvement; the full solution with no orphans at all is tracked separately.

Ref FS-236

linear-code · 2026-03-10T14:39:26Z

FS-236 Test and ensure consistency between BigTable and GCS

Previously, large-object inserts followed LT-first ordering: write data to long-term storage, then write the redirect tombstone to high-volume. A failure or pod kill between those two steps left an orphaned long-term object — data in LT with no tombstone in HV, permanently unreachable. This flips the ordering to HV-first: write the tombstone first, then write the data. A failure between the two steps now leaves a headless tombstone (tombstone in HV, no data in LT) instead. Headless tombstones are safe and self-healing: reads return None, deletes remove them, and re-inserts overwrite them. For small objects, a new `put_non_tombstone` trait method is added that atomically rejects the write if a tombstone already exists at the key, routing the payload to long-term storage instead. BigTable implements this with a CheckAndMutateRowRequest; other backends use a non-atomic read-then-write default. There is one remaining gap: a concurrent insert + delete can still race to produce an orphaned long-term object. Fixing that requires per-key serialization and is tracked separately. Co-Authored-By: Claude <noreply@anthropic.com>

Deduplicates the expiration/mutation-building logic shared between put_row and put_non_tombstone into a single write_mutations method. Co-Authored-By: Claude <noreply@anthropic.com>

…tcome Both types represented the same concept — an operation that either executed or was blocked by a redirect tombstone — with different variant names. Unify them into ConditionalOutcome { Executed, Tombstone }. Co-Authored-By: Claude <noreply@anthropic.com>

…ibility Keep the ConditionalOutcome rename from this branch while adopting the pub visibility from main.

jan-auer force-pushed the worktree-invert-tombstones branch from 1ab7bfe to 9f5bb43 Compare March 10, 2026 14:46

jan-auer and others added 3 commits March 10, 2026 16:33

ref(service): Extract write_mutations helper in BigTable backend

01c166f

Deduplicates the expiration/mutation-building logic shared between put_row and put_non_tombstone into a single write_mutations method. Co-Authored-By: Claude <noreply@anthropic.com>

merge: Resolve conflict between ConditionalOutcome rename and pub vis…

435e352

…ibility Keep the ConditionalOutcome rename from this branch while adopting the pub visibility from main.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf(service): Write HV tombstone before LT data to reduce orphan risk#365

perf(service): Write HV tombstone before LT data to reduce orphan risk#365
jan-auer wants to merge 4 commits intomainfrom
worktree-invert-tombstones

jan-auer commented Mar 10, 2026 •

edited

Loading

Uh oh!

linear-code bot commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

jan-auer commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

linear-code bot commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jan-auer commented Mar 10, 2026 •

edited

Loading